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Abstract. In this paper, we investigate the Merton portfolio management problem 
in the context of non-exponential discounting. This gives rise to time-inconsistency 
of the decision-maker. If the decision-maker at time t = can commit his/her suc- 
cessors, he/she can choose the policy that is optimal from his/her point of view, and 
constrain the others to abide by it, although they do not see it as optimal for them. 
If there is no commitment mechanism, one must seek a subgame-perfect equilibrium 
strategy between the successive decision-makers. In the line of the earlier work by 
Ekeland and Lazrak [5] we give a precise definition of equilibrium strategies in the 
context of the portfolio management problem, with finite horizon, we characterize 
it by a system of partial differential equations, and we show existence in the case 
when the utility is CRRA and the terminal time T is small. We also investigate the 
infinite-horizon case and we give two different explicit solutions in the case when the 
utility is CRRA (in contrast with the case of exponential discount, where there is 
only one). Some of our results are proved under the assumption that the discount 



'•Work supported by PIMS under NSERC grant 298427-04. 



1 



2 



function h(t) is a linear combination of two exponentials, or is the product of an 
exponential by a linear function. 
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1. Introduction 

The discounted utility model (DU) has been in use since the beginning of economic 
theory. Landmark papers ar the ones by Ramsey in 1928 and Samuelson in 1937. 
There is by now a very rich literature, the common assumption being that the dis- 
count rate is constant over time so the discount function is exponential. The model 
makes it possible to compare outcomes occurring at different times by discounting 
future utility by some constant factor. A decision maker with high discount rates ex- 
hibit more impatience (care more about the future) than one with low discount rates. 
Most of financial-economics works have considered that the rate of time preference 
is constant (exponential discounting). However there is growing evidence to suggest 
that this may not be the case. Ainslie PQ performed empirical studies on human and 
animal behavior and found that discount functions are almost hyperbolic. Loewen- 
stein and Prelec [9] show four drawbacks of exponential discounting and propose a 
model which accounts for them. They discuss implications for savings behavior and 
estimation of discount rates. 

As soon as discounting is non-exponential, the decision-maker becomes time- 
inconsistent: a policy, to be implemented after time t > 0, which is optimal when 
discounted at time 0, no longer is optimal if it is discounted at a later time, for in- 
stance t. If the decision-maker at time can commit the later ones, that is, constrain 
them to follow the policy he/she has decided upon, then the policy which is optimal 
from his/her perspective can be implemented. But, apart for the dubious validity of 
doing that (why should his/her perspective be better than the one of decision- maker 
at time t, who, after all, will have to carry out a policy decided long before, and will 
be the one to bear its consequences ?) it is not often the case that management deci- 
sions are irreversible; there will usually be many opportunities to reverse a decision 
which, as times goes by, seems ill-advised. 

Existing Research 

Dynamic inconsistent behavior was first formalized analytically by Strotz [16] . 
Further work by Pollak [15], Peleg and Yaari [13], Goldmann [6] on this issue ad- 
vocates that the policies to be followed should be the output of an intra-personal 
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game among different temporal selves (today's self is a different player from tomor- 
row's self). Laibson [10] considers a discrete time consumption- investment economy 
without uncertainty. An agent (self) observes past consumption and financial wealth 
levels and chooses the consumption level for period t. They establish existence of 
a unique subgame perfect equilibrium. It is characterized by time-dependent con- 
sumption rules which are linear in wealth and should satisfy an Euler type equation. 
The work of Barro [2] is in a non-stochastic Ramsey model with logarithmic utility, 
and the discount function as a special form, whereby the rate of time preference is 
high in the near future but almost constant in the distant future. Krusell and Smith 
[8] also consider a discrete non-stochastic Ramsey paradigm with quasi-geometric 
discounting. They seek the equilibrium policy as the solution of a subgame-perfect 
equilibrium where the players are the agent and her future selves, and they show 
that there are several solutions to this problem. All this work is in the deterministic 
case, usually with discrete time. 

Ekeland and Lazrak [5J consider a deterministic problem with continuous time, 
namely the Ramsey problem of economic growth with non-exponential discounting. 
They define an subgame perfect equilibrium strategies by letting the decision maker 
at time t build a coalition with his/her immediate successors s, with s G [t, t + e] 
and by letting e — ► 0. They show that these strategies are characterized by two 
equivalent equations: a partial differential equation with a non-local term and an 
integral equation. The PDE coincide with the Hamilton- Jacobi-Bellmann equation 
of optimal control if the discounting is exponential. 

Our Contributions 

Dynamic asset allocation has been an area in finance which received a lot of at- 
tention lately. The ground-breaking paper in this literature is Merton [12]. He 
considered a model consisting of a risk-free asset with constant rate of return and 
one or more stocks, each with constant mean rate of return and volatility. An agent 
invests in this market and consume to maximize her expected utility of intertempo- 
ral consumption and final wealth. Merton was able to derive closed form solution 
for Constant Relative Risk Aversion (CRRA) and Constant Absolute Risk Aversion 
(CARA) preferences. It turns out for CRRA preferences, the optimal consumption 
and investment in the risky asset are a constant proportion of wealth. This is not 
the case for CARA although they are still linear in wealth. Karatzas et al. [7J, Cox 
and Huang [3| also solved the optimal investment and consumption problem by the 
static martingale method. In another paper Merton [11] also investigates the infinite 
horizon case. Although it yields the same quantitative results it is easier to handle 
- provided the constant discount rate satisfies a condition known in the literature as 
the transversality condition. It does not seem to have much empirical support, and 
what happens if it is not satisfied is a matter of debate. 
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All the papers mentioned above are in the exponential discounting paradigm which 
as we shown earlier was challenged by some economic literature. 

The goal of this paper is to understand how non-exponential discounting affects 
an agent's investment-consumption policies in a Merton model. First we show that 
doing naive optimization does not work in the absence of commitment technologies. 
Then we follow the approach of [5] and introduce the concept of equilibrium policies 
in a stochastic context. In the finite horizon case we give a first description of 
the equilibrium policies for a general discount function through the solutions of a 
flow of BSDEs. If the discount function is exponential, these conditions reduce 
to the classical HJB equation, so the equilibrium policy coincide with the optimal 
one given by dynamic programming. We then introduce special classes of discount 
functions, type I which is a linear combination of two exponentials, and type II which 
is the product of an exponential by a polynomial of degree one, and we show that 
for discount functions in that class, equilibrium solutions can be characterized by a 
system of HJB equations. We go one step further and characterize them through a 
parabolic PDE system. This system does not seem to have been considered before. 
Existence of a solution can be established in the case of CRRA utility. 

It is perhaps surprising that the same integral equation (IE) from |5J holds in this 
model for the special discounts. This suggests that (IE) holds for any discounts. We 
can see this for other types of discount but we are not concern with a general result 
in this direction. In an infinite horizon model the novel feature is stationarity and 
the key observation is, to borrow a sentence from [5], the decision maker at time t 
resets her watch so that time s becomes s — t, so she faces the same problem as the 
decision- maker at time 0. Keeping this in mind we define the equilibrium policies in 
this context. We follow the same approach as for finite horizon: first we describe them 
through the solutions of a infinite horizon BSDE and then for the special discounts 
through an ODE system and an integral equation. We end this case with the study of 
CRRA preferences were we find interesting results. As one might probably expected 
the equilibrium policies and optimal ones coincide if the discounting is exponential 
given the transversality condition of [11J. However we find a weaker condition to 
grant existence of an equilibrium policy so there may be circumstances when the 
equilibrium policy exists but the optimal one may not (in the sense that one can not 
perform the verification argument). Even more interesting for type I discounts we 
find instances when there are two equilibrium policies. Moreover transversality-like 
conditions were worked out. The equilibrium policies for CRRA preferences resemble 
the optimal ones. More precisely the agent invests the same constant proportion of 
her wealth in the risky asset as in the Merton problem and consumes also a constant 
proportion of her wealth which is different than the Merton one for non-exponential 
discounting. 

Organization of the paper 
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The reminder of this paper is organized as follow. In section 2 we describe the 
model and formulate the objective. Section 3 and 4 treat the finite horizon and 
infinite horizon problem. The conclusions are summarized in section 5. The paper 
ends with Appendix containing the proofs. 

2. The model and problem formulation 

2.1. Financial Market. We adopt a model for the financial market consisting of a 
saving account and one stock (risky asset). The inclusion of more risky assets can be 
achieved by notational changes. The saving account accrues interest at the riskless 
rate r > 0. The stock price per share follows an exponential Brownian motion 

dS(t) = S(t)[adt + adW(t)], < t < oo, 

where {W(t)}te\o,oo) is a 1— dimensional Brownian motion on a filtered probability 
space (f2, {J r t}o<t<T, J 7 , P). The filtration {J r t}o<t<T is the completed filtration gen- 
erated by {W(t)}t£[o,oo)- As usual a is the mean rate of return and a > is the 
volatility. Let us denote fi = a — r > the excess return. 

2.2. Investment-consumption policies and wealth processes. A decision-maker 
in this market is continuously investing her wealth in the stock/bond and is consum- 
ing. An investment-consumption policy is determined by the proportion of current 
wealth her/she invests in the bond/stock and consume (consumption rate). Formally 
we have: 

Definition 2.1. An IR 2 -valued stochastic process {((t), c(t)} t£ [ 0tOO is called an ad- 
missible policy process if it is progressively measurable, c(t) > 0, for all t e [0,oo) 
and it satisfies 

sup E[|C(t) \ m + c m {t)] < oo Vm=l,2,---, (2.1) 

0<t<T 

This condition originates from [14J and is needed since we are using their result. 

Given a policy process {((t), c(t)} te [ 0iOO ), the proportion of the wealth A^' c (t), at 
time t invested in the stock is ((t) and the consumption rate is c(t). The equation 
describing the dynamics of wealth X^' c (t) is given by 

dX c ' c {t) = X<' c {t) ((e*C(t) - c(t)) dt + a((t) dW(t)) + (1 - ({t))X^ c {t)r dt 

= X^ c (t)((r + fi((t)-c(t))dt + (T((t)dW(t)). (2.2) 

It simply says that the changes in wealth over time are due solely to gains/loses from 
investing in stock, from consumption and there is no cashflow coming in or out. This 
is usually referred to as the self-financing condition. 
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Under the regularity condition (12.11) imposed on and c(t) above, (12.21) . admits 
a unique strong solution given by the explicit expression 

X^ c {t) = X(0) exp (J (r + /i((u) - c{u) - KK>| 2 ) du + J a((u) dW(u) 

The initial wealth X^' c (0) = X(0) G (0, oo), is exogenously specified. 

2.3. Utility Function. All decision-makers have the same von Neumann-Morgenstern 
utility. This is crucial for understanding the model: time-inconsistency arise, not 
from a change in preferences, but from the way the future is discounted. All decision- 
makers try to maximize the discounted expectation of a function U : (0, oo) — > R 
strictly increasing and strictly concave, which is their (common) utility. We restrict 
ourselves to utility functions which are continuous differentiable and satisfy the Inada 
conditions 

U'(0+) = lim U'(x) = oo, [/'(oo) = lim U'{x) = 0. (2.3) 

We shall denote by /(•) the (continuous, strictly decreasing) inverse of the marginal 
utility function U'(-), and by (12.31) 

1(0+) = liml(z) = oo, I(oo) = lim I(x) = 0. (2.4) 

The agent is deriving utility from intertemporal consumption and final wealth. Let U 
be the utility of intertemporal consumption and U the utility of the terminal wealth 
at some non-random horizon T (which is a primitive of the model and we may alow 
it to be infinite). 

2.4. Discount Function. Unlike other works in this area we do not restrict our- 
selves to the framework of exponential discounting. Following [5 J , a discount function 
h : [0, oo] — > R is assumed to be continuously differentiable, with: 

h(0) = 1, h(s) > 0, 

and 

/■oo 

/ h(t) dt < oo. 
Jo 

The definition and characterization of equilibrium strategies (Theorems 13.21 and I4.2p 
hold for general discount functions. We will then particularize them to two special 
cases, which we will call pseudo- exponential discount functions. They are of two 
types, type I: 

hi (t) = Aexp(-pit) + (1 - A) exp(-p 2 t) (2.5) 

and type II: 

h 2 {t) = {1 + Xt)exp{-pt) (2.6) 



Pseudo-exponential discount rates were first considered in the context of time- 
inconsistency in [5]. Note that, in contrast to the more studied case of hyperbolic 
discount, decision-makers discount the distant future less heavily than the immediate 
future. It will be clear from our approach how to handle other cases. 

2.5. Objective. We conclude this section by formulating our problem. The objec- 
tive is to find time consistent policies and the optimal ones may fail to have this 
feature. Indeed, if the agent starts with a given positive wealth x, at some instant t, 
her optimal policy process {Ct( s )>Ct( s )}ue[t,T] is chosen such that 

eT 



sup E 



h(u - t)U(c(u)X ( ' c (u)) du + h(T - t)U(X^ c (T)) 



= E J h{u-t)U{c{u)X^~ c {u))du + h{T - t)U{X c ' £ {T)) 
The value function associated with this stochastic control problem is 

•T 



V(t, s, x) 



sup E 



h(u - t)U{c{u)X^ c {u)) du + h(T - s)/7(X c ' c (T)) 



X(s) 



x 



t < s <T, and it solves the following Hamilton- Jacobi-Bellman equation 
dV 



-(t,s,x) + sup 
os C;C 



, s dV, s 1 2 ,2 2 d 2 V. 
[r + tK- c)x—(t, s, x) + -a ( x yjC*, s, x) 



h'(s - 1) 

4-V(t, s, x) + U(xc) 



0. 



h(s - 1) 
with the boundary condition 

V(t,T,x) = U(x). (2.7) 
The first order necessary conditions yield the t— optimal policy {Ct(s),c t (s)} se [ tt T] 



(t(s,x) 



Ct(S,X 



V%(t,s,x) 

2 9 2 V (+ \ 
G x q x 2 yf'i S, X) 



t<s<T, 



t<s<T. 



X 



(2.8) 



(2.9) 



Therefore, unless the discounting is exponential (in which case H- = constant, so 
there is no t dependence in the HJB), the t— optimal policy may not be optimal after 
t. That is 

eT 



{(t(s),Ct(s)} sE [t>,T] ^ argmax Cc E 



h{u - t')U(c(u)X c ' c {u)) du + h(T - t')U(X^ c (T)) 



s 



for some subsequent instant t' , so the decision-maker would implement the t— optimal 
policy at later times only if she is constrained to do so. This failure to remain optimal 
across times can be regarded as time inconsistency. 

Because of time-inconsistency, optimal solutions are irrelevant in practice (al- 
though they do exist mathematically) and one must look for an alternative way 
to solve the problem. This will be done by considering equilibrium policies, that is, 
policies such that, given that they will be implemented in the future, it is individually 
optimal to implement them right now. Following [5] consider: 

- Fx(s,x) F 2 (s,x) 

C(s,x) = , c{s,x) = , (2.10) 

X X 

for some functions F 2 and the equilibrium wealth process {X(s)} sG [ 0; t] evolves 
according to 

dX(s) = [rX(s) + /iFi(s,X(s)) - F 2 (s,X(s))]ds + (rF 1 (s,X(s))dW(s). (2.11) 

The functions F\, F 2 are chosen such that on [t, t+e] it is optimal (this is made precise 
in our formal definition of equilibrium policies) to pick C(£,x) = Fl ^ ,a ^ , c(t, x) = 

F2 ^'^ , given the agent's wealth at time t is x, and for every subsequent instance 
s> t + e she follows f l2~TU|) . 

3. Finite Horizon 

3.1. General Discount Function. Let T be a finite time horizon exogenously 
specified. In general, for a policy process {C( s )> c ( s )}se[o,T] satisfying (12 .ip and its 
corresponding wealth process {X(s)} sG [ 0j t] (see f)2.2p ) we denote the expected utility 
functional 

rT 



J(t,xX,c)±E 



h(s - t)f/(c(s)X*' x (s)) ds + h(T - t)U(X l ' x (T)) 



(3.1) 



We shall give a rigorous mathematical formulation of the equilibrium policies in the 
formal definition below. 

Definition 3.1. A map F — (Fx, F 2 ) : (0, oo) x[0,T]^lx [0, oo) is an equilibrium 
policy for the finite horizon investment-consumption problem, if for any t, x > 

lim J(t,x,F u F 2 )-J(t,x,( e ,c e ) 

40 e 

where 

J(t,x,F 1 ,F 2 ) = J(t,x,C,c), 
F l{ s,X(s)) F 2 (s,X(s)) 
sv ' X(s) v ; X(s) K ' 



and {C( s )j c ( s )}se[t,T} should satisfy (12.11) . The equilibrium wealth process {X(s)} s£ [ tj T} 
is a solution of the stochastic differential equation (SDE) 

dX(s) = [rX(s) + nFi(s,X(s)) - F 2 (s,X(s))]ds + aF^s, X(s))dW(s). (3.4) 

The process {( £ (s),c t (s)} s€ [ t ,T] is another investment-consumption policy defined 

by 




with i5 e)f = + e], and {((s), c(s)} se[ttT] is any policy for which {(e(s),c t (s)} se[t . T] 
satisfy (TO) . 

The basic idea is the following: the agent (self) has a different rate of impatience 
jM- as times goes by (unless the discounting is exponential) and can be regarded 
as a continuum of agents (selves); at every instant t she is building a coalition with 
her immediate selves s, with s G [t, t + e] and try to maximize expected utility of 
intertemporal consumption and terminal wealth given that the selves on [t + e, T] 
agreed upon an equilibrium strategy. 

Our next item in the agenda is to characterize the equilibrium policies by a means 
of some adjoint processes defined by a flow (one for every instant t) of backward 
stochastic differential equations (BSDE). More precisely for every < t < T the 
processes {M(t, s), N(t, s)} sG [ tj T] are a solution of the BSDE 

dM{t, s) = - (M{t, s) (r + fi^s, X(s)) - X(s))) + a^{s, X(s))N(t, s)+ 

+h{s - t)^(s, X(s))U'(F 2 (s, X(s))Yj ds + N(t, s)dW(s) 

[M(t,T) = h(T-t)U'(X(T)), 

(3.7) 

where the equilibrium wealth process {X(s)} sG [ 0i t] follows (13.41) . 

The next central result does not depend on the choice of the discount function h. 

Theorem 3.2. Assume there exists a map F = (F 1 , F 2 ) : (0, oo) x [0, T] — > Rx (0, oo), 
continuously differentiable with respect to x such that for every t e [0, T] there exists 
a solution {M(t, s), N(t, s)} se [ t ,T] of (13. 7p which satisfies 

fiM(t,t) + cN{t,t) =0, (3.8) 
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and 

F 2 {t,x) = I{M{t,t)\X{t) = x). (3.9) 
Then F is an equilibrium strategy. 

Appendix A proves this Theorem. 

3.2. Special Discount Functions. Following j5] we would like to give a character- 
ization of the equilibrium policies in terms of both a partial differential equation and 
an integral equation. We restrict ourselves to three types of discounting although 
our method goes far beyond. We consider exponential, and pseudo-exponential dis- 
counting. Let us introduce the Legendre transform of — U{— x) 

U(y)±mv[U(x)-xy] = U(I(y))-yI(y), < y < oo. (3.10) 

The function [/(•) is strictly decreasing, strictly convex and satisfies the dual rela- 
tionships 

U'{y) = -x iff U'{x) = -y. (3.11) 
The next Theorem is our main result. It describes the equilibrium policies through 
a coupled system of parabolic equations. 

Theorem 3.3. Assume there exist two functions v(t, x) and w(t, x) three times con- 
tinuously differentiable which satisfy 

— (t, x) + rx—{t, x) - + U \^—(t, x)j= aij v(t, x) + /3 ljW (t, x), 

dw, , ( T (dv, X\dw, , fj, 2 ^{t,x)^{t,x) 

— (t, x) + [rx - I — (t, x)\ -=-(*, x) - !-^ dxK ' JdxK 
dt K J V \dx K J JJ dx K } a 2 

u 2 \^(t } x)} 2 p§(t } x) 

*° [|?(M)] 2 
for all (t,x) G [0,T] x (0,oo), with boundary condition 

v(T,x) = U(x), w(T,x) — 0. 
Then F = F 2 ) given by 

F 1 (t,x) = - ^l\ X \ , F 2 (t,x) = l(^(t,x)), te[0,T], (3.12) 

is an equilibrium policy. The coefficients a^-, (3ij corresponds to different choices of 
discount functions. Thus for exponential discounting 

aio = 5, a 20 = 0, (3 W = 0, p 20 = 0, 
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for type I 

«n = Xp 1 +(l-X)p 2 , «2i = P1—P2, Pn = X{l-X)(p 1 -p 2 ), P21 = Ap 2 +(1-A)pi, 
and for type II 

"12 = p - A, a 22 = -A, /?i2 = A, /3 2 2 = p + A. 
Appendix B proves this Theorem. 

Remark 3.4. Let us point out that for the case of exponential discounting the equi- 
librium policy coincide with the optimal one given by dynamic programming. Note 
that the value function v 

r-T 



v(t, x) = supE 



- <5(s -' ) f/(c(s)X^ c (s)) als + e- <5(T -' ) f>(X c ' c (T)) 



X(t) 



x 



and w = satisfies the PDE system of Theorem 13. 31 



(3.13) 



The next Proposition is in the spirit of Theorem 2 in [5] and gives a description of 
the equilibrium policy through an integral equation. It is stated only for the special 
discounts. 

Proposition 3.5. Assume there exist two functions v(t,x) and w(t,x) three times 
continuously differentiable which solve the PDE system of Theorem \3.tA Then v(t, x) 
satisfy the integral equation 

r-T 



v(t, x) 



E 



h(s - t)U(F 2 (s, ds + h(T - t)U(X ' X (T)) 



(3.14) 



Recall that {X(s)} se [o,T] is the equilibrium wealth process and it satisfies 

dX(s) = [rX(s) + fj,Fi(s, X(s)) - F 2 (s, X(s))]ds + aF^s, X(s))dW(s). (3.15) 

Appendix C proves this Proposition. 

Remark 3.6. This Proposition suggests that for a general discount function h(t) the 
equilibrium policies are of the form (13.121) for a function v as in (13.141) . It is not hard 

to see it holds for h(t) = (1 + Ant + \ l2 t 2 H ) exp(-pit) + (1 + \ 2 \t + A 22 t 2 + 

■■■)exp(-p 2 t). 

In the next Proposition stochastic representations for v and w are given. 

Proposition 3.7. Assume there exist two functions v(t,x) and w(t,x) three times 
continuously differentiable which solve the PDE system of Theorem \3.3[ Then 



w(t, x) = E 



a 2j / exp(-p 2 j(s-t))v(s,X t ' x (s))ds 



12 



v(t, x) 



E 



exp(— aij(s — t)) 



ds + h(T-t)U(X t<x (T)) 



Therefore 



v(t, x) 



E 



exp(— ay(s — t)) 



U I 



dv 



s,X 



t,x I 



-a 2 jPij / exp(— a 2j (z - s))v(z, X ,x (z)) dz 



dx 

ds + h(T -t)U{X^ x {T)) 



Proof: It is a direct consequence of Feynman-Kac's formula. 

□ 

The main question (since all the results of this section are based on it) is: when 
does the system of Theorem 13.31 have solutions? Of course the answer is known for 
exponential discounting. Although there is work in progress, we only have a partial 
answer at the moment and that is for CRRA preferences, U(x) = U(x) = — . In 
this case one can disentangle time and wealth and look for v(t,x) = f(t)— and 
w(t,x) = g(t)—, where f(t), g(t) solve the ODE system 



f(s) + Kf(s) + (l-p)[f(s)}^ = a lj f(s)+p lj g(s), 



(3.16) 



g'(s)+Kg(s)-pg{ S )[f(s)}^ = (3 2j f{s) + {3 2j g{s), (3.17) 
for all t G [0, T] with boundary condition 

f(T) = 1, g(T) = 0, 

2 

where K = rp + 2a ^(v-i) • Existence of this is obvious on small intervals (up to 
explosions) [T — e, T] for some e > 0. One can get a global result for small /3y (small 
A) by the Implicit Function Theorem. 

4. Infinite Horizon 

We next investigate stationary equilibrium policies and this is done in the infinite 
horizon framework. Before engaging into the formal definition let us point the follow- 
ing key fact. For a time homogenous policy process {((t), c(t)} t6 [o j00 ) satisfying (12. ip 
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and its corresponding wealth process {X(t)} t6 [ 0jOO ) (see ( 12. 2p ) the expected utility 
functional J(x, (, c) satisfies 



J{x,C,c) 



E 
E 
E 



/i(s-t)£/(c(s)A^(s))(fe 
h(s)U(c(s)X t ' x (t + s))ds 
h(s)U(c(s)X°' x (s))ds 



(4.1) 



This is due to the fact that the processes {X t,x (t + s)} se [o,oo) and {X ' x (s)} s& [q :OO ) 
have the same P distribution. Following the case of finite horizon we have: 

Definition 4.1. A map F = (Fx, F 2 ) : (0, oo) — > R x [0, oo) is an equilibrium policy 
for the infinite horizon investment-consumption problem, if for any x > 



where 



eio e ~ 



J(x,F l ,F 2 ) = J(x,(,c),_ 

m = m F2{x{t)) 



(4.2) 



(4.3) 



X(t) ' w X(t) ' 
and {((t), c(t)}te[o,oo) should satisfy (12. ip . The equilibrium wealth process {X(t)} t6 [ 0iOO ) 
satisfies 

dX(t) = [rX(t) + [J,Fi(X(t)) - F 2 {X{t))]dt + aFx(X(t))dW(t). (4.4) 

The process {£,= (£), c e (t)}t g [o j0 o) is another time homogenous investment-consumption 
policy defined by 

'((t), te [0,oa)\E t 

C(t), t e £ e 



Ce(t) 
C £ (t) 



(4.5) 
(4.6) 



c(t), te[0,oo)\E e 
c(t), t E E e . 

Here E t C [0, oo) is a measurable set with Lebesque measure \E e \ = e, and {((t), c(t)} f6 [ 0)OO ) 
is any time homogenous policy for which {( e (t), c e (£)}i e [ 0iOO ) satisfies (12. ip . 
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In a first step we would like to describe the equilibrium policies in terms of the 
process {M(t), iV(£)} te [ 0jOO ), the solution of the infinite horizon BSDE 

'dM(t) = -(M(t)(fiFl(X(t)) - F&X(t)) + aF{(X(t))N(t) 
+h(t)F 2 '(X(t))U'(F 2 (X(t)))dt + N(t)dW(t) (4.7) 
M(oo) = 0, te[0,oo), 

and the equilibrium wealth process {X(i)} te r 0)OO ) it is given by (|4.4p . For more about 
infinite horizon BSDE see [3]. The next result is the infinite horizon counterpart of 
Theorem 13 .21 

Theorem 4.2. Assume there exists a map F = (iq, F 2 ) : (0, oo) -> f x (0, oo), con- 
tinuously differentiable with respect to x such that there exists a solution {M(t), N(t)} t e[o,oo) 
of (14. 7ft which satisfy 

(iM(0) + aN(0) = 0, (4.8) 

and 

F 2 (x) = I(M(0)\X(t) = x). (4.9) 
Then F is an equilibrium strategy. 

The proof is similar to Theorem 14.21 so it is skipped. 

□ 

As in the preceding Section we find the equilibrium policies for exponential, type 
I and type II discounting. With the coefficients cty and as in Theorem 13.31 we 
have the following formal result. 

Theorem 4.3. Assume there exist two functions v(x) and w(x) three times contin- 
uously differentiable which solve the following ODE system 

ff 2 v l2 (x) 

rxv'(x) - 2er 2 V n^ + U ( v '( x )) = a ij v ( x ) + 0ijw(x), 

. T . ... , [i 2 v' (x)w' (x) fi 2 [v'(x)] 2 w"(x) . , n , . 

(rx - I(v (x)))w (x) - — 2 + — 2 = a 2j v{x) + (3 2j w{x), 

for all x G (0, oo). If in addition the following transversality condition 

M(oo) = 0, (4.10) 
is satisfied, then F = (iq, F 2 ) given by 

^) = -4rrv F2(x) = I(v'(x)), (4.11) 
is an equilibrium policy. 

The proof follows as in Theorem 13.31 
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□ 

The next Proposition gives the description of equilibrium policies through an in- 
tegral equation (IE) as in the Proposition 13.51 

Proposition 4.4. Assume there exist two functions v(x) and w(x) three times con- 
tinuously differentiable which solve the ODE system of Theorem \4-3[ Then v(x) 
satisfies the integral equation 



h{t)U{F 2 {X^ x {t)))dt 



(4.12) 



v{x) = E 

The proof follows as in Proposition 14.31 

□ 



nential, type I, type II discounts and further investigate the case of U(x) — Lets 
look for the function v of the form v(x) = — , for a constant k which is to be found. 

9al1 

F x {x)= (1 _^L 2 , F 2 (x) = k^x, (4.13) 



4.1. CRRA Preferences. In this subsection we are still in the paradigm of expo- 
nential, type I, type II discounts and further 

look for the function v of the form v(x) = ^y- 
The equilibrium policies are linear in wealth 

Fi{x) = 

(1 -p)c 

and the corresponding wealth process is 

m = MO) exp ((r + ii^g -k±)t + J^-mt)) . (4.14) 

According to Proposition 14.41 the value function v should satisfy the integral equation 
(14.121) (and this is also sufficient to grant that Fi,F 2 of (14.131) is an equilibrium 
policy), which in this context becomes 

POO 

kik = / h{u)e lu du, (4.15) 
Jo 

where 

Let us treat the three cases independently. 
4.1.1. Exponential Discounting. With h(t) = exp(— <5£), (14.151) reads 

k^f(6-k) = l, (4.17) 

given that 

5 > k. (4.18) 
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Thus 



The condition 



P 



5 — rp — 



VP 



2(1 -p)a< 



p-i 



5 > (p V 0) 



P~ 



_l_ r 



2(1 -p)a 2 

should hold true in view of positivity of k. 
The adjoint process {M(t)} 4e [ 0jOO ) is given by 

M(t) = exp(-5t)v'(X(t)) 



(4.19) 



(4.20) 



kX(0) 



exp ( ( r(p 



i) + ^V -(p-i)^-^ + %) 



2(1 -p)a 2 



fcX(O) 



exp l — l r 



2a' 



t + £w(t) 

a 



whence the condition M(oo) = is automatically satisfied. 
Remark 4.5. Condition (14.201) is weaker than the transversality condition 

(2~P)P 2 



5>(p\/0) 



_l_ r 



(4.21) 



2(1 -p)a 2 

of [H] . If ( 14.211) is met the equilibrium policy coincide with the optimal policy given 
by dynamic programming. However if 

1 r (2 /,)/r 



(pVO) 



2(1 -p)a 2 



< 5 < (p V 0) 



2(1 -p)a 2 



_l_ r 



equilibrium policy still exists but one cannot prove verification for the optimal policy. 

4.1.2. Type I Discounting. If h{t) = A exp(— pit) + (1 — A) exp(— p 2 t), equation (14. 151) 
leads to 



k 1 -p 



A 



1 - A 



(4.22) 



pi-h p 2 -k 
given that 

Pi-k>0, i = 1,2. (4.23) 
This can be expressed as the positive root of the quadratic equation for z = k?- 1 

Q(z) = Az 2 + Bz + C = 0, (4.24) 

where 



A^l-p, 



(4.25) 
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5 = (2p-l) ^^-^+^4- ( Pl + P2 ), 



C = — 



V 



Pi — rp 



VP 



2(1 -p)o 2 



P2 - rp 



VP 



When pi = p 2 = S, the equation (14.241) becomes 



p)z 



pr 



VP 



2(1 -vWJ _ 



z H — ( 5 — pr 
V 



2(1 -p)a 2 



VP 

2(1 -p)a 2 J 



which compered to ( 14.191) brings in a new solution 



(4.26) 
(4.27) 

0, (4.28) 

!(l- fW j' (429) 

Thus when pi, p 2 are close to 5 the equation (14.241) has at least a positive solution 
provided that 5 satisfies (14.201) . It has two positive solutions if p < 0. 

Let us search for the transversality condition sufficient to grant (14. lOj) (M(oo) = 0). 
The process {M(t)} tg [ oo ) is given by 

M(t) = Xexp(- Pl t)v[(X(t)) + (1 - A) exj>(-p 2 t)v' 2 (X{t)), 

where 



Z6 



1 

V 



rp 



VP 



VAX 



E 



U(F 2 (Xin)ds 



1,2. 



(4.30) 
(4.31) 



Consequently 



VAX) 



k 1 ~px p 
V(pi ~ k) 



1,2. 



(4.32) 



Furthermore, for i = 1, 2 



X(0)k— 
(pi - k) 



exp 



expi-PisWiiXit)) 



r(p - 1) + 



(2p-iy 

2(1 -p)a 2 



(p - l)fc5=T -Pi)t+ -W(t) 



Pi 



a 



Therefore the transversality condition reads 



1 



1 — p 



Pi ~ 



(2p-l)p 
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1,2. 



(4.33) 



2(1 -p)a 2 _ 

In general it is not obvious if the equation (14.241) has a positive solutions for which 
the transversality condition and (14.231) are met. However this is the case when the 
Pi are close to 5 and 5 satisfies (I4.20p . 



IS 

It is interesting to note that in some cases there may be two equilibrium policies. 
Let us illustrate this in an example: we take p > |, P—\ and denote y = r+ 2 (i- p ) a 2 ■ 

Moreover, for some small e > let p\ = py + e, p 2 = py — e, so that B = e( - 1 ~ 2A ^ , and 

C = — . If A > x „ i+v 7 ^) the roots of <_ > e so 

that f)4.23p holds true. The transversality condition (14.331) is satisfied if e is chosen 
small enough. 

4.1.3. Type II Discounting. If h(t) = (1 + Xt) exp(— pi), equation (I4.15P leads to 

i 1 A 

= =r + (4.34) 

p — k (p — k) 2 

given that 

p-k>0. (4.35) 
This can be expressed as the positive root of the quadratic equation for z = kp- 1 

Q(z) = Az 2 + Bz + C = 0, (4.36) 

with 

A = l-p, (4.37) 



The transversality condition in this case is 

(2p~l)p 2 



1 — p 



2(1 -p)a 2 



(4.40) 



Similar to the type I discounting it is possible to establish a positive solution of 
(14.361) which satisfy f!4.35j) and f)4.40p when A is close to zero. 



5. Concluding Remarks 

This paper introduced a novel concept in stochastic optimization namely the notion 
of equilibrium policies. We analyze the Merton portfolio management problem in 
the context of exponential and non-exponential discounting. Although type I, II 
and exponential discounting are considered only, our methodology can be extended 
for more general discount functions. The optimal policies are characterized by both 
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an integral equation and a system of partial differential equations. The infinite 
horizon case is covered as well and it is shown that in some situations there are more 
equilibrium policies. Moreover for CRRA preferences the equilibrium policies are to 
consume and invest in the risky asset a constant proportion of the corresponding 
wealth, which is similar to the optimal policy for exponential discounting, but the 
constants are different. 

6. Appendix 

A Proof of Theorem [3721 

We assume U = r = to ease the presentation. Let {X e (s)} se [ 0i T] be the wealth 
corresponding to {CO), c e (s)} se[0 ,T] i.e., 

dX e (s) = X £ (s)((r + Ke(s) - c e (s)) ds + a( e (s) dW(s)). (6.1) 

The processes {y e (s)} se [o,T] and {Z e (s)} se [o,T] defined by the SDE 

dY\s) = Y*(s)(^(s, X(s)) - 9 -§(s, X(s))) ds + a[Y*(t)?£(s, X(s) + 
+ (X(s)((s)-F 1 ( S ,X(s)) X EMdW(s) (6.2) 

r e (o) = o 

and 

f dZ%s) = [Z%s)(^(s,X(s)) - ^( s ,X(s)))+ 
+ (/iA > ( S )(C(s) - ^(s,X(s))) - (c(s)X(s) - F 2 ( S ,X( S )) XEe (s)]ds+ 
< +[aZ^s) d §(s,X(s)) + (aY*(s)(((s) - ^(s, X(s)) X eM <W{s) 
Z<(0) = 

can be regarded as first order and second order variation of {X(s)} se [ 0> r] 
At this point we need the following Lemma from [TJ]. 

Lemma 6.1. For any k > 1 

sup E\X e {s) - X{s)\ 2k = 0{e k ), (6.3) 

sS[0,oo] 

sup E\Y e (s)\ 2k = 0(e k ), (6.4) 

«e[o,oo] 

sup E\Z e (s)\ 2k = 0(e 2k ) } (6.5) 

se[o,oo] 

sup E\X e {s) - X(s) -Y e {s)\ 2k = 0{e 2k ) (6.6) 
se[o,oo] 
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sup E|X e (s) - X(s) - Y e (s) - Z e (s)\ 2k = o(e 
se[o,oo] 

In the light of this Lemma the following expansion holds: 

J(t,x,( e ,c e ) = J(t,x,F 1 ,F 2 ) 



2k \ 



(6.7) 



+E 



h(s - t)(U(c(s)X^(s)) - U(F 2 (X^(s))) XEe (s) 



dF 2 



+h(s - t)(Y%s) + Z%s))^(s, X^{s))U> 



,fdF 2 



dx 



s,JP'*(s)) ds + o(e 



In the above equation we would like to get rid of {Y t (s)} se [t,T], {Z e ( s )}se[t,T] and 
the way to accomplish this is by using the adjoint processes {M(t, s), N(t, s)} ae [ t) T\ 
and integration by parts. It turns out 



Y e (s 



dF 2 

dx 



s,X(s))U' 
dFi 



, fdF 2 



dx 



s,X(s)) ds = -Y e (s)dM(t,s) 



Y*(s)M(t,s)[^(s,X(s))-^ 



(s,X(s))+aY%s)N(t,s)^(s,X(s)) 



ds 



+Y € (s)N(t,s) dW{s) 



and 



ZHs) 



d_F\ 

dx 



s,X(s))U> 
dFi 



,fdF 2 



dx 



s,X(s)) ds = -Z e (s)dM(t,s) 



Z*(s)M(t,s)[^(s,X(s))-^ 



dF 

(s,X(s) )+aZ*(s)N(t,s)-^-(s,X(s)) 



ds 



+Z e (s)N(t,s) dW{s), 



Consequently 



dF 2 



E I h(s-t)(Y*(s) + Z*(s))-^-(s,X(s))U' 



,[dF 2 



dx 



s,X(s))) ds 



-E 



-E J + Z e (s))dM(t } s) 

(Y*(s) + Z*(s))M(t, s) L^±{a, X(s)) - ^(s, X(s) ) + 



dF 

a(Y^s) + Z*(s))^(s,X(s))N(s) 
Moreover by (16.21) and the equation following it 



ds. 
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E J (Y*( a ) + Z*(s))M(t, s) (l*^±(a, X(s)) - ^-(s, X(s)j ds = 

eJ M(t,s)d(Y e (s) + Z e (s)) 

-Ej T [M{t,s)(vX{s)(as) - F 1 (s,X(s))) - (X(s)c(s) - F 2 {s, X{s)))] XEe {s)) ds 
Thanks to Ito's formula 

d(M(t, s){Y t {s) + Z e (s)) = M{t, s)d{Y t (s) + Z £ (s)) + {Y e {s) + Z e (s))dM(t, s) 

+ d(Y £ (s) + Z e (s))dM{t,s), 
which in conjunction with the previous equations yield 

h(s - t)-^(s, X(s))U'(F 2 (s, X(s)))(Y e (s) + Z e {a)) ds 

= E J [M{t, s)fj,{X(s)({ S ) - F 1 (X(s))] ds 

+E J T [{X{s)c{s) - F 2 (X{s))) - {aN(t, s)(X{s)((s) + F 1 (X (s)))] X eM ds 

POO 

-E / aM(t, s)Y*(s)((as) + F 1 (X (s))) X eXs) ds. 
Jo 

Next we introduce the Hamiltonian function H by 

H(t, s, x,u,m,n) = mx{Qjj, — c) + nxQa + h(s — t)U(cx), u=((,c). (6.8) 
The asymptotical expansion leads to 

J(t, x, F 1} F 2 ) - J(t, x, Ce, c e ) EjT *M{t, s)Y*{s) (j^{s, X^(s))-C(s)^ X eM ds 

+E J [H(t,s,X^(s),as),c(s),M(t,s),N(t,s)) 

-H(t, s, X^{8), C(s), c(s), M(t, s),N(t, s))] XEe (s) ds + o(e). 
Additionally, with sufficient integrability assumptions and (16.41) 

E g aM(t, s)Y*(s) X^(s)) - {(a)) XE( (s) ds 

lim 

whence 



lim i = 0, (6.9) 

ej.0 e 
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H[t,t, x, M(t, 0, N(t, t) ) -H(t, t, x, C(t), c(t), M(t, t), N(t, t)) 



elO e 

r / FJt.x\ F 

= E 

Therefore a sufficient condition for F = (Fx, F2) to be a equilibrium policy is 

f F 1 (t,x) ^ F 2 (t,x) \ =argmax ji(t t x £ ^M(t,t),N(t,t)), (6.10) 

\ X X J ^' 

Finally, the linearity of H in £ implies 

fiM(t,t) + <rN(t,t) = 0, (6.11) 
and the first order conditions for c (which are also sufficient due to concavity of 

U) 

F 2 (t,x) = I(M(t,t)\X(t) = x). (6.12) 

□ 

B Proof of Theorem 1X51 

The result is established for all three cases (exponential, type I, type II) separately. 

Exponential discounting: h(t) = e~ 5t 

Let v(t, x) be the solution of the classical HJB 

^ x)+rx ^ x) .A^ + u{^ (t ^ =fc(tiX) , (6 , 3 ) 

for all (t,x) G [0,T] x (0, 00), with boundary condition 

v(T, x) = U (x). 

Then v(t, x) and w(t, x) = solve the parabolic system. 
We show the equilibrium strategies are 

F 1 (t,x) = - ^; X \ , F 2 (t,x) = l(^(t,x)), te[0,T\. (6.14) 



- 2 0M)' ^ \dx 

Indeed lets consider the processes 

f^n eft 11 

M(t,s) = e- s ^ — (s,X(s)), N(t,s) = ae- 5 ^F 1 (s,X(s)) — (s,X(s)), (6.15) 

s G [t, T]. Recall that the equilibrium wealth process {X(s)} s< =[ 0j t}, is defined by 
dX(s) = [rX(s) + fiFi(a, X(s)) - F 2 (s, X(s))}ds + aF x (a, X(s))dW(s). (6.16) 
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It is a matter of direct calculations to prove that {M(t, s), N(t, s)} s6 [t,T] solves BSDE 
(13.71) . Next we observe 

pM{t,t) + aN(t,t) = 0, (6.17) 

and 

F 2 {t,x) = I{M{t,t)\X{t) = x), (6.18) 
so by Theorem I3.2[ F = (Fx, F 2 ) is an equilibrium strategy. 

□ 

Type I discounting: h(t) = Aexp(— pxt) + (1 — A) exp(— p 2 t) 

Let v and wbea solution of the PDE system. We show the equilibrium strategies 



are 



Fx(t,x) 



f dv \ 

, F 2 (t,x) = ll— (t,x)J, te[0,T\. (6.19) 



Let us define the process {M(t, s), N(t, s)} 8 £[t,T\ by 



M(t, s) 4 A exp(- Pl ( S - t))^(s, X(s)) + (1 - A) exp(-p 2 (s - t))^(s, X(a)), 

ox ox 

and 

N(t, s) 4 Xa exp(- Pl (s - t))Fx(s, X(s))-^(s, X(s)) 

r\ 2 

+(l-X)aeM-P2(s-t))Fx(s,X(s))-^(s,X( S )), 

for some functions Vx and v 2 which will be specified later on and {X(s)} sg [ 0> T] is 
the equilibrium wealth process of ( 16.161) . By requesting {M(t, s), N(t, s)} s e[t,T] to 
solve BSDE (13. 7ft . one finds a PDE system for (vx, v 2 ). 

Indeed, on one hand by Ito's formula 



dM(t,s)= I Aexp(-pi(s -t)) 



d 2 Vx 



(7 



d 3 vx 



+(pFx(s,X(s)) - F 2 (s,X(s)))^(s,X(s)) + -F^s, X(s))^(s, X(s)) 



+(1 - A) exp(-p 2 (s - t)) 

d 2 v 2 



p 2 ^l( s ,X(s)) + ^(s,X(s))+ 



dx 



dxds 



+(p,Fx(s,X(s))-F 2 (s,X(s))) 



On the other hand from (13. 7)) 



dx2 ,s,X(s)) + ^F?(s,X(s))^(s,X(s)) 
+N(t,s)dW(s). 



ds 
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dM(t,s)= ( Aexp(-pi(s - t)) 
dF l 



dx 



dF 1 



dF 2 



s,X(s)) n— -(s,X(s)) - ^(s,X(s) + 



d 2 Vl 



dx 



dx 



+a 2 F 1 (s,X(s))^(s,X(s))^(s,X(s)) + -^(s,X(s))U'(F 2 (s,X(s))) 



dx 

+(1 - A) exp(-p 2 (s - t)) 
dF 1 



dx 2 
dv 2 
dx 
d 2 v 2 



dx 



+ 



s,X(s))[^(s,X(s))-^( S ,X(s)) + 



dF 2 



+a 2 F 1 (s, X(s))-^(s, X(s))-^(s, X(s)) + -^(s, X(s))U'(F 2 (s, X(s))) 

+N(t,s)dW(s). 
The two representations of dM(t, s) yield 



ds+ 



and 



dv\ d 2 v\ d 2 v\ 



d 3 V! 



dx 3 



dF 1 



dF 2 



+— F 1 (s, a;)— —(a, x) = - ^x— — (s, x) - — — (s, x) — (s, x) 



<9x 



dx 



dv 



dx 



2 , ,dF x d 2 v x dF 2 dv 

-a Fl (s, x ) — (s, x) — (s, x) - —{s, x)-(s, x), 



dv 2 , d 2 v 2 . , , . , . ,,d 2 v 2 . . 
"P2 — (s, a;) + tttttI 5 ' x ) + - ^(s, x))— — (s, x) 



<9x 



<9 3 v 2 



dxds 



+—F l (s, x)-q-z(s, x) = - ( /i— (s, x) 



9Fi 



5x 



<9x 2 

<9F 2 „ , di> 



a Fi(s,x)— — (s, x) (s, x) — (s, x) — — (s, x). 



dx 



dx 2 



dx 



dx 



This can be rewritten as 
d_ 

dx 



dvi . . . . . . . . di>i 

— (s,x) - piwi(s,x) + (/iFi(s,x) - F 2 (s,x)J — (s,x) 



d 2 vi 



= 0, 



and 



<9x 



dv 2 . . . . . . . . 9f 2 

— (s, x) - p 2 V2(s, a;) + - ^(s, x)J — (s, x) 



cr 2 , <9 2 l> 2 



s,x) + C/(F 2 (s,x)) 



0. 
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Recall that v and w is a solution of the PDE system. Thus 

Vx = v + (1 — X)w, v 2 = v — Xw (6.20) 
satisfy the above PDEs. Moreover 

nM(t,t) + aN(t,t) = 0, (6.21) 

and 

F 2 (t,x)=I(M(t,t)\X(t) = x), (6.22) 
so by Theorem 13.21 F = (F±, F2) is an equilibrium strategy. 

□ 

Type II discounting: h(t) = (1 + At) exp(— pt) 

Let v and wbea solution of the PDE system. We show the equilibrium strategies 
are 



Fi(t,x) = - ^g;/ '- , F 2 (t,x) = l(—(t,x)), te[0,T]. (6.23) 



<r 2 Wr{t,x) \dx 
Let us define the process {M(t, s), N(t, s)} se [ tt T] by 

M(t, s) 4 exp(-p( S - t))^(s, X(s)) + Xt exp(-p( S - t))^(s, X(s)), 

ox ox 

and 

N{t,s)±aeM-Pi{s-t))F 1 {s,X{s))-^(s,X{s)) 



+a\tew{-p2(s-t))F 1 (s,X{s)) ir §-{s,X(s)), 



, d 2 v 2 
for the functions 

i>i = i>, t>2 — u — w. (6.24) 

As for the case of type I one can check that {M(t, s), N(t, s)} sS [ t) T] solves BSDE 
(13.71) . Moreover 

fiM(t,t) + aN(t,t)=0, (6.25) 

and 

F 2 (t, x) = I(M(t, t)\X(t) = x), (6.26) 
so by Theorem I3.2[ F = (F±, F 2 ) is an equilibrium strategy. 

□ 
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C Proof of Proposition 13.51 

If the discounting is exponential the result is a direct consequence of Feynman- 
Kac's formula. For type I, the functions v± and v 2 of (I6.20p admit the following 
stochastic representations 



Vi(t, x) = E 



J e- p ^ s - t) U{F 2 (X t t : x ))ds 



1,2, (6.27) 



by Feynman-Kac's formula. Consequently 

v(t,x) = Xv x (t,x) + (1 - X)v 2 (t,x) = E 
Similarly for type II, (16.281) holds true. 



J h{s-t)U{F 2 {X t f))ds 



(6.28) 



□ 
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